A genome-wide approach for detecting novel insertion-deletion variants of mid-range size
نویسندگان
چکیده
We present SWAN, a statistical framework for robust detection of genomic structural variants in next-generation sequencing data and an analysis of mid-range size insertion and deletions (<10 Kb) for whole genome analysis and DNA mixtures. To identify these mid-range size events, SWAN collectively uses information from read-pair, read-depth and one end mapped reads through statistical likelihoods based on Poisson field models. SWAN also uses soft-clip/split read remapping to supplement the likelihood analysis and determine variant boundaries. The accuracy of SWAN is demonstrated by in silico spike-ins and by identification of known variants in the NA12878 genome. We used SWAN to identify a series of novel set of mid-range insertion/deletion detection that were confirmed by targeted deep re-sequencing. An R package implementation of SWAN is open source and freely available.
منابع مشابه
PRISM: Pair-read informed split-read mapping for base-pair level detection of insertion, deletion and structural variants
MOTIVATION The development of high-throughput sequencing technologies has enabled novel methods for detecting structural variants (SVs). Current methods are typically based on depth of coverage or pair-end mapping clusters. However, most of these only report an approximate location for each SV, rather than exact breakpoints. RESULTS We have developed pair-read informed split mapping (PRISM), ...
متن کاملGene Copy-Number Polymorphism Caused by Retrotransposition in Humans
The era of whole-genome sequencing has revealed that gene copy-number changes caused by duplication and deletion events have important evolutionary, functional, and phenotypic consequences. Recent studies have therefore focused on revealing the extent of variation in copy-number within natural populations of humans and other species. These studies have found a large number of copy-number varian...
متن کاملStructural Variation Discovery and Genotyping from Whole Genome Sequencing: Methodology and Applications: A Dissertation
A comprehensive understanding about how genetic variants and mutations contribute to phenotypic variations and alterations entails experimental technologies and analytical methodologies that are able to detect genetic variants/mutations from various biological samples in a timely and accurate manner. High-throughput sequencing technology represents the latest achievement in a series of efforts ...
متن کاملI-38: Chromosome Instability in The Cleavage Stage Embryo
Recently, we demonstrated chromosome instability (CIN) in human cleavage stage embryogenesis following in vitro fertilization (IVF). CIN not necessarily undermines normal human development (i.e. when remaining normal diploid blastomeres develop the embryo proper), however it can spark a spectrum of conditions, including loss of conception, genetic disease and genetic variation development. To s...
متن کاملGenome-wide Association: “A Revolutionary Approach”
Genome-Wide Association studies (GWAS) have brought a revolutionary change or paradigm shift in detecting novel variants for complex disorders and shifting the burden of finding the biological relevance of these newly discovered variants on biochemists and physiologists, hence it is a movement from forward to reverse genetics. Here we discuss the role of such studies with GWAS designs from anth...
متن کامل